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O ' Abstract 

We derive new results for the performance of a simple greedy algorithm for finding large inde- 
pendent sets and matchings in constant degree regular graphs. We show that for r-regular graphs 
with n nodes and girth at least g, the algorithm finds an independent set of expected cardinality 

f{r)n — 0( 1^1 — n), where /(r) is a function which we explicitly compute. A similar result is estab- 
lished for matchings. Our results imply improved bounds for the size of the largest independent set 
in these graphs, and provide the first results of this type for matchings. As an implication we show 
that the greedy algorithm returns a nearly perfect matching when both the degree r and girth g are 
large. Furthermore, wc show that the cardinality of independent sets and matchings produced by 

fj ' the greedy algorithm in arbitrary bounded degree graphs is concentrated around the mean. Finally, 

we analyze the performance of the greedy algorithm for the case of random i.i.d. weighted indepen- 
dent sets and matchings, and obtain a remarkably simple expression for the limiting expected values 

^ . produced by the algorithm. In fact, all the other results are obtained as straightforward corollaries 

[-«-. ' from the results for the weighted case. 
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1 Introduction 

O I 1.1 Regular graphs, independent sets, matchings and randomized greedy algo- 
rithms 
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An r-regular graph is a graph in which every node has degree exactly r. The girth g of a graph is the 
k> , size of the smallest cycle. Let G{g, r) denote the family of all r-regular graphs with girth at least g. 
H \ For a graph G, we denote the set of nodes and edges by V{G) and E{G), respectively. A set of nodes 
/ is defined to be an independent set if no two nodes of / are adjacent. For a graph G, let I{G) denote 
(any) maximum cardinality independent set {MIS) of G, and |2r(G)! its cardinality. Throughout the 
paper we will drop the explicit reference to the underlying graph G when there is no ambiguity. For 
example we use I instead of X(G) or V instead of V{G). 

Suppose the nodes of a graph are equipped with some non- negative weights Wi, 1 < i < n = \V\. 
The weight W[I] of a given independent set / is the sum of the weights of the nodes in /. When the 
nodes of a graph are equipped with weights which are generated i.i.d. using a continuous distribution 
function F(t) = P(Wj < t) with non-negative support, we denote by 2w the random unique with 
probability 1 (w.p.l) maximum weight independent set (MWIS) of G. 
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A (partial) matching is a set of edges M in a graph G such that every node is incident to at most one 
edge in M. For a graph G, let M. denote (any) maximum cardinality matching {MM) of G. Suppose 
the edges of a graph are equipped with some non-negative weights We,e G E. The weight W[M] of 
a given matching M is the sum of the weights of the edges in M. When the edges of a graph G are 
equipped with weights generated i.i.d. using a continuous distribution function F with non-negative 
support, we denote by My/ the random unique (w.p.l) maximum weight matching {MWM) of G. 

In this paper we analyze the performance of a simple greedy algorithm, which we call GREEDY , 
for finding large independent sets and matchings. The description of the GREEDY algorithm is as 
follows. For independent sets, GREEDY iteratively selects a node i uniformly at random (u.a.r) from 
all remaining nodes of the graph, adds i to the independent set, deletes all remaining nodes adjacent to 
i and repeats. Note that while the underlying graph is non-random, the independent set produced by 
GREEDY is random as it is based on randomized choices. For MWIS, GREEDY iteratively selects 
the node i with the greatest weight from all the remaining nodes, adds i to the independent set, deletes 
all the remaining nodes adjacent to i and repeats. Note that when acting on a fixed weighted graph, 
the action of GREEDY is non-random. In this setting, the randomness will come from the fact that 
the weighting itself is i.i.d. For matchings GREEDY operates similarly, except that it chooses edges 
instead of nodes, and deletes edges incident to the chosen edge. 

Let TQ{M.Q) denote the random independent set (matching) returned by GREEDY when run on 
an unweighted or (randomly) weighted graph G, depending on context. Denote by VK[X^](I^[7W^]) 
the weight oiZQ{J^Q) (for the weighted case), and by |X^|(|A1^|) the respective cardinalities (in the 
unweighted case). Our goal is obtaining bounds on the expectation and variance of \TQ\, \M.Q\, W[IQ], 
W[A4Q], where the latter two will be considered for the case of i.i.d. continuous non-negative weight 
distributions. One of the motivations is to derive new lower bounds on largest independent set in 
constant degree regular graphs with large girth. 

1.2 Summary of our results and prior work 

Our main results are Theorems ll|2l which provide remarkably explicit upper and lower bounds on the 
expected weight of the independent set and matching produced by GREEDY in a regular graph of 
large fixed girth when the weights are generated i.i.d. from a continuous non-negative distribution. 
Since the gap between the upper and lower bound is of the order ?« (r — ly''^ /{g/2)\, we also obtain 
the limiting expression for the weight of the independent set and matching produced by GREEDY in 
a regular graph when the girth diverges to infinity. These results are Corollaries ll|2[ 

As a corollary we obtain upper and lower bounds on E[|X^|] and E[|A4^|], by considering a uniform 
distribution which is highly concentrated around 1 . These results are stated as Theorems I4|51 Again 
the gap between the upper and lower bounds is of the order ^ [r — l)^''^ /{g/2)l and we obtain a limiting 
expression when the girth diverges to infinity, as stated in Corollaries [3l O While Corollary [5] is a new 
result. Corollary [3] is not. This result was recently established by Lauer and Wormald }LWj using 
a different approach called the 'nibble' method. Thus our Theorem [5] can be viewed as an explicit 
finite girth correction to the limiting result (Corollary [3]) derived earlier in |LWj and proved here using 
different methods. 

Our results on the performance of the GREEDY algorithm, as well as the results of [LW] . are 
motivated by the problem of obtaining lower bounds on the size of the largest independent set in 
regular graphs, and specifically regular graphs with large girth. The history of this problem is very 
long [HS82] ■ [She83| . [S"he91| . [She95| with |LW| being the latest on the subject. In particular, the lower 
bounds obtained in [LW] are the best known for the case r > 7 and sufficiently large girth, and in this 
range they beat previous best bounds obtained by Shearer |She91] . Although these bounds are the best 



known as the girth diverges to infinity (for any fixed r), the bounds given in [LW| for any fixed girth 
are very difficult to evaluate, as they are given implicitly as the solution to a large-scale optimization 
problem. Our bounds match those of [LWj for any fixed r as the girth diverges to infinity, and give 
simple explicit bounds for any fixed girth as a finite girth correction of the order ss (r — l)^'"^ /{g/2)\. 
In addition, our bounds are superior for several instances discussed in jLW] where bounds were derived 
numerically by lower-bounding the aforementioned large-scale optimization problem. The details of this 
comparison are presented in Section [6l 

Our corresponding results for matchings are new, both the limiting version, CorollarylU and the finite 
girth correction. Theorem [5l Interestingly, by considering the upper and lower bounds in Theorem [5] 
and taking a double limit r,g ^ cx), we find that the GREEDY algorithm produces a nearly perfect 
matching in the double limit r,g ^>- cx). This partially answers an open problem posed by Frieze |Fri] 
regarding the construction of a simple, decentralized algorithm for finding a nearly perfect matching in 
constant degree regular graphs with large girth. 

Our second set of results. Theorems [3] and [6l concerns the variance of the weight (cardinality) of the 
independent set and matching produced by GREEDY in arbitrary graphs with bounded degree. That 
is no additional assumptions on girth or regularity are adopted. We show that when the weights are i.i.d. 
and have finite second moment, and when the graph has bounded degree, the variance, appropriately 
normalized, is of the order 0(l/n) in both cases. We are also able to give explicit bounds in terms 
of the graph degree, the number of nodes, and the second moment of the weighting distribution. We 
also give similar results for the unweighted case. Thus the answers produced by GREEDY are highly 
concentrated around their means, and in this sense the GREEDY algorithm is very robust. We believe 
these are the first results on the variance of the GREEDY algorithm. 

We now review some additional relevant literature. The AIIS, MWIS, MM and MWM problems 
are obviously well-studied and central to the field of combinatorial optimization. The MIS problem is 
known to be NP-Complete, even for the case of cubic planar graphs |GJS76] and graphs of polynomially 
large girth [Mur92j . and is known to be MAX — SNP complete even when restricted to graphs with 
degree at most 3 |BF94] . From both an approximation algorithm and existential standpoint, the MIS 
problem has been well-studied for bounded degree graphs |IIR94] . [HR97J, |BF94| : graphs with large 
girth |MS85| . |Mur92] : triangle-free graphs with a given degree sequence |AKS80| . |AEKS81| . [GriSSj . 
She83| . [She91]: and large-girth graphs with a given degree sequence, including regular graphs with 



large girth jBolSO] . [HS82] . [She91] . |Den94j . |She95] . [LW] . We note that, as already mentioned, our 
Corollary [3] was derived earlier in [LW] using different techniques. 

Although the MM problem is solvable in polynomial time, much research has gone into finding 
specialized algorithms for restricted families of graphs. The most relevant graph families for which 
MM has been studied (often using GREEDY and related algorithms) are bounded-degree graphs, 
and bounded-degree graphs of girth at least 5 |DF91| . [MP97j . However, there appears to be a gap 
in the literature for MM in regular graphs with large girth, barring a recent existential result that 
an r-regular graph with large girth g always contains a matching of size ^ — 0{{r — l)~2n) }FII07] . 
Namely, an asymptotically perfect matching exists in such graphs as the girth increases. It is of interest, 
however, to construct some decentralized and easy to implement algorithm for MM which leads to an 
asymptotically perfect matching, and our result Theorem [5] is a step towards this direction. 

Our main method of proof uses the correlation decay technique, sometimes also called the local 
weak convergence (objective) method [AldOlj . [AS03J . [GNS] . We establish that the choices made by the 
GREEDY algorithm are asymptotically independent for pairs of nodes (in the case of independent sets) 
and edges (in the case of matchings) which are far apart. That is, if two nodes i,j are at a large graph- 
theoretic distance, then P(i,j € IG) ~ P(i £ IQ)¥{j £ TQ). A similar statement holds for matchings, 
and also for the weighted case with i.i.d. weights. This allows the reduction of the problem on a graph 



to the far simpler problem formulated on a regular tree, which can be solved in a very explicit way. 
Such an asymptotic independence was also observed in |LW| . but here we are able to characterize this 
decay in a more explicit manner. A similar phenomenon was also observed in |GNSj . which studied 
maximum weight independent sets and matchings for the case of i.i.d weights in r-regular graphs with 
girth diverging to infinity. There it was observed that for the case of i.i.d. exponentially distributed 
weights, such a decay of correlations occurs when r = 3, 4 and does not occur when r > 5, even as 
the girth diverges to infinity. Thus the techniques of |GNSj were only able to analyze exponentially 
weighted independent sets in regular graphs of large girth when the degree was r < 4. In contrast we 
show that independent sets produced by GREEDY always exhibits such a decay of correlations for any 
degree. This allows us to extend the analysis of [GNS] to regular graphs of arbitrary constant degree. 
In Section [6] we will see that GREEDY is nearly optimal for the settings considered in [GNSj . 

We now give an outline of the rest of the paper. In Section [2] we state our main results formally 
and show that our analysis for the case of i.i.d. weights encompasses the analysis for the unweighted 
case. In Section [3] we introduce the notion of an influence blocking subgraph, show that under an i.i.d. 
weighting most nodes (edges) will belong to such subgraphs, and show that these subgraphs determine 
the behavior of GREEDY. This enables us to prove certain locality properties of GREEDY, which 
we then apply to the setting of regular graphs of large constant girth. In Section |4] we introduce and 
study a bonus recursion that we will use to analyze the performance of GREEDY on infinite r-ary 
trees. Section [5] is devoted to proving results on the variance of GREEDY. In Section [6] we numerically 
evaluate our bounds and compare to earlier bounds in the literature. Finally, in Section [7] we provide 
directions for future work and summary remarks. 

1.3 Notations and conventions 

We close this section with some additional notations. Throughout the paper we consider simple undi- 
rected graphs G = {V,E). Given a simple path P in a graph G, the length of P is the number of 
edges in P. Given two nodes i,j € V, the distance D{i,j) is the length of a shortest i to j path in G. 
Similarly, the distance D{ei,e2) between two edges ei,e2 £ E is the length of the shortest path in G 
that contains both ei and 62, minus one. Given a node i aV, let the depth— d neighborhood Nd{i) be 
the subgraph rooted at i induced by the set of nodes i' with D{i,i') < d. Givn an edge e, let Nii{e) 
denote the subgraph induced by the set of edges e' with D{e, e') < d. Specifically, for every node i and 
edge e, Nq{i) = {i} and iVo(e) = {e}. For simplicity we write A^(-) for A''i(-). |A^(i)| is the degree of the 
node i, and maxjgy |-^(«)| is defined to be the degree of the graph. 

Given a rooted tree T, the depth of T is the maximum distance between the root r and any leaf, 
and the depth of a node i in T is D{r, i). Given a node i G T, the set of children of i is denoted by G{i). 

Suppose the nodes of an undirected graph G are equipped with weights Wi. We say that a path 
^1, ^2, ■ ■ ■ ) ^fc is node increasing if Wi^ < ■ ■ ■ < Wi^ . Similarly, if the edges of G are weighted Wij, we say 
that a path ii, ^2, • • ■ ,ik is edge increasing if Wi^i^ < ■ ■ ■ < Wi^_^i^. 

Denote by T{r,d),d > 1 a depth-d tree where all non-leaf nodes have r children, and all leaves are 
distance d from the root. Denote by T{r + l,r,d),d > 1 the depth-d tree where the root has r + 1 
children, all other non-leaf nodes have r children, and all leaves are distance d from the root. Note that 
if G € G{g, r) for some 5 > 4, then for every node i E y{G) and any d < [^^J , N(i{i) is (isomorphic 
to) T{r, r — 1, d). By convention, T{r, 0) and T{r + 1, r, 0) both refer to a single node. 

Throughout the paper we will only consider non-negative distribution functions, so the non-negativity 
qualification will be implicit. If X is a discrete r.v. taking values in Z_(_, the corresponding probability 
generating function (p.g.f.) is denoted by (j)x{s) = Y^^=q^^'^{-^ = k). If two r.v. X and Y are equal 
in distribution, we write X = Y . When X is distributed according to distribution F, we will also write 



(with some abuse of notation) X = F. The m-fold convolution of a random variable X is denoted by 
X'™). Let W he a continuous r.v. and let X be a r.v. taking non-negative integer values. Denote by 
W^'^^ the r.v. maxi<j<x Wi when X > and when X = 0. Here Wi are i.i.d. copies of W. Given 
two events A,B, let AaB and Ay B denote, respectively, the conjunction and disjunction events. Also 
A"^ denotes the complement of the event A and I{A) denotes the indicator function for the event A. 

2 Main results 

2.1 Weighted case 

The following is our main result for the performance of the GREEDY algorithm for finding largest 
weighted independent sets. Both in the context of independent sets and matchings we assume that the 
weights (of the nodes and edges) are generated i.i.d. from a non-negative continuous distribution F. 

Theorem 1. For every 5 > 4 and r > 3, and every continuous non-negative r.v. W = F with density 
f and E[W] < 00, 



x{r-l-(r-2)F(x)] "'^ f(x)dx -mw f^^ „ ' 



< inf E 

GGG(9,r) 



w[ig] 
\v\ 



< sup E 

GeG(g,r) 



w[ig] 

\V\ 



<l x[r-l-{r-2)F{x)) f{x)dx + E[Wl 



f (r — 1) L 2 J 



As an immediate corollary, we obtain the following result. 
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Corollary 1. For every r > 3 and every continuous non-negative r.v. W = F with density f and 

E[W] < 00, 



lim inf E 

g-^oo GeG{g,r) 



w[ig] 

\V\ 



lim sup E 

9^°^GeG(g,r) 



w[ig] 



V 



x(r-l-{r- 2)F(x) ) ' ' f{x)dx. 



(1) 



We now present the results for matchings. 
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Theorem 2. For every g > 4 and r > 3, and every continuous non-negative r.v. W = F with density 
f and E[W] < 00, 



X I r — 1 — (r — 2)F(x] 



"■"' fix)dx-E[W]-^ ' 



(L^J)! 



< inf E 
GGGig,r) 



w[Mg] 
\v\ 



< sup E 

GeG{g,r) 



w[Mg] 
\v\ 



r 
< - 
- 2 



2(r-l) , -.s|_2^J 

:^i^r-l-{r-2)Fix)) ' '-' fix)dx + E[w f^''~^' 



An immediate implication is 
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Corollary 2. For every r > 3 and every continuous non-negative r.v. W = F with density f and 

E[W] < oo, 



lini inf E 
g^oo GeG{g,r) 



wjMg] 

\V\ 



lim sup E 



w[Mg] 



V 



x(r-l- (r-2)F(.x) 



2(r-l) 
r-2 



f{x)dx. 



(2) 



We now state our main results on bounding the variance of W[IQ] and W[Aig]. 
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Theorem 3. For every continuous non-negative r.v. W = F with E,[W'^] < oo, and for every graph G 
with degree r > 3, 



^, ,W\Tg], 9E[iy2ir2e('-i)' 
Var\ — TT^. — I < 



\V\ 



\V\ 



and 



Vary — - — ■. — ^1 < 



1^1 



1^1 



(3) 



(4) 



We stress that, unlike previous results, no assumption is made on the structure of the graph other 
than a bound on the maximum degree. 

2.2 Unweighted case 

As we will show in the following subsections. Theorems [T] and [5] lead to the following bounds on the 
cardinality of independent sets and matchings produced by GREEDY in regular unweighted graphs. 

Theorem 4. For every 5 > 4 and r >2>, 

2 1 g — 2 I 

1 - (r - 1) ---a r(r -1)L~2-J 



-, < inf E[^] < sup E[M] 

2 (L^J+1)! ~GeGM ^ \V\^ - G^G{a,r) M^l 

2 1 9-2 

1 - (r - 1) --a r(r-l)L^~J 

- 2 ^ (L^J + 1)!' 

The following immediate corollary is an analogue of Corollary [1] for the unweighted case. 
Corollary 3. For every r > 3, 



(5) 



lim inf E 
g^ocGeG(g,r) 



\ig\ 
\v\ 



lim sup E 



m 



l-{r-iy 



2 

'r-2 



A second corollary is the following lower bound on the size of a maximum independent set in an 
r-regular graph with girth > g. 



Corollary 4. For every 5 > 4 and r > 3, 



\I\ 1 — (r — 1) "--a rir — 1) 

inf f4 > ^ 

GGG(g,r) \V\ 2 



L^J 



(L^J+1)! 



(6) 



Our results for matchings are as follows. 
Theorem 5. For every g > 4 and r > 3, 



!-(.-!)-- .(r-l)W^ .^^ ^^IMGl^^ ^^^ ^^IMGl 



(L^J)! ~GeG(g,r) ' \V\ 



G€G{9,r) 



V 



1 - (r - 1) --a r(r - 1)L''2 J 



Corollary 5. For every r > 3, 



lim inf E 

g^oo GeG(3,r) 



\Mg\ 

\V\ 



(L^J)! 



lim sup E 

9-*°° GeG{g,r) 
1 - (r - 1)"^ 



\Mg\ 



V 



(7) 



As a result 



lim lim inf E 

r-^oo g-^oc GeG{g,r) 



\Mg\ 
\v\ 



lim lim sup E 
1 

2' 



\Mg\ 



V 



Namely, GREEDY finds a nearly perfect matching when both the degree and girth are large. A 
second corollary is the following lower bound on the size of a maximum matching in an r-regular graph 
with girth > g. 



Corollary 6. For every g > 4: and r > 3, 



\M\ 1 - (r - 1) '■-2 r(r -1)I-S-J 
inf > 



GGG(a,r) \V\ 



(L^J)! 



(8) 



Bounds on the variance of W[ig],W[Aig] will result in the following bounds for the variance of 

\igi\Mg\. 



Theorem 6. For every graph G with degree r >3, 



and 






^""f^^ - \E\ 



(9) 



(10) 



2.3 Converting the weighted case to the unweighted case 

In this section, we prove that all of the results pertaining to GREEDY^s performance w.r.t. finding 
unweighted independent sets and matchings are implied by our analysis for the case of i.i.d. weights. 
This will allow us to focus only on the case of i.i.d. weights for the remainder of the paper. 



Lemma 1. Theorem[l\ implies Theorem^ and Theorem\^ implies Theorem\^ 



Proof. We first prove that Theorem [T] implies Theorem HI Fix e > 0. Let F be a uniform distribution 
on [1 — e, 1 + e]. Applying Theorem [1] we have 

x((r-l)- (r-2) ^^ ^-) ^^—dx ^ ' 



l-e 



< inf E 



W\LG\ 

\y\ 



2e 

< sup E 

GgG{g,r) 



26 (L^J + 1)! 



1+^ x-(l-e),_^_l r(r -1)LVJ 



< / x((r - 1) - (r - 2) 



2e 



-2 — dx + 



2e (L^J+1)!' 



Note that (1 - e)E[|J^|] < E[Ty[2:^]] < (1 + e)E[|Ja|], and for 1 - e < x < 1 + e we have ((r - 1) 



«;Mi^^-7^^>0. Thus 



(1 - e) r\{r - 1) - (r - 2 ^ ' ^' ' 'h '^^dx '^' " '^^'^ ^ 



l-e 



< (1 + e) inf E 

G(iG{g,r) 



2e 



26 (L^J+1)! 



< (1 + 6) sup E 
GGG(g,r) 



1^1 



< -^ — (r-l)-(r-2 ^ '-) '-2— dx + ^ — TT^ 

Letting u = ^^^ ''' , we can apply integration by substitution to find that: 



r[r 



(1 - 6) j^ ((r - 1) - (r - 2)ur—^du - ^ ^ ^^ 



< (1 + 6) inf E 

GGG(s,r) 



\V\ 



< (1 + e) sup E 

G&G(g,r) 



\TQ\ 
\V\ 



< (llil! /"((._ 1) _(._ 2),,)- A,„ + i±i!l!:^i)i±i. 

- l-e Jo l-'d'rJ + l)! 

Evaluating the integrals and letting e — > then demonstrates the desired result. The proof that 
Theorem [2] implies Theorem H] follows identically, using the bounds for matchings instead of those for 
independent sets. D 

Lemma 2. Theorem\^ implies Theorem\^ 

Proof. We first prove that ([3D implies (0). Let again F be a uniform distribution on [1 — e, 1 + 6]. The 
following bounds are immediate 



\V\ 



< Var[ 



\V\ 



W[1Q] 
\V\ ^ 



< (1 + g)2]E[(M)2] _ (1 _ e)2E2[ffl], 



\V\ 



\V\ 



which impHes that 



|yar[M] - Var[^]\ < (2e + .^)(E[(^)^] + E^[M]). 



.2 



Thus since the second moment of F is 1 + ^ , by the triangle inequahty and Theorem [3] we find that for 
any graph G of maximum degree r, 

Observing that |wr| < 1, we see that ([9]) follows by letting e — > 0. 

The proof of (llOp from ([4]) is done similarly. D 



3 Influence blocking subgraphs 

In this section we introduce the notion of an influence blocking subgraph, and give a useful characteri- 
zation of these subgraphs. We then bound the probability that a node (edge) of a bounded degree graph 
G is contained in (an appropriately) small influence blocking subgraph under an i.i.d. weighting from 
any continuous distribution function. Throughout this section we consider a graph whose nodes and 
edges are equipped with non- negative distinct (non-random unless otherwise stated) weights Wi,i £ V 
and We,e G E. 

Definition 1. A subgraph H of G is called an influence blocking subgraph (i.b.s.) if for every node 
(edge) z £ H,Wz> ui&-Ky(,N {z)\H Wy. 

Here N{z) \ H means the set of nodes or edges (depending on the context) in N{z) which do not 
belong to H. We now show that for any set of nodes (edges) Z there exists a unique minimal i.b.s. H 
containing Z, and give a simple characterization of this subgraph. 

Lemma 3. Given a set of nodes (edges) Z there exists a unique minimal i.b.s. H containing Z . Namely, 
for every other i.b.s. H' containing Z , H is a subgraph of H' . Moreover, H is characterized as the set 
of nodes (edges) z such that there exists a node (edge) increasing path zi, . . . ,Zk with z\£ Z and Zf^ = z. 

We denote this unique minimal i.b.s. by IBg{Z), or IB{Z) when the underlying graph G is unam- 
biguous. 

Proof. We first show that IBg{Z) is contained in every i.b.s. T containing Z. Suppose, for the purposes 
of contradiction, there exists an increasing path zi,Z2, ■ ■ ■ ,Zk such that zi £ Z,Zk ^ T. Let I < khe the 
largest index such that zi E T. Then zij^i € N{zi), but Wzi_^^ > Wzi, which is a contradiction to the 
fact that T is an i.b.s. 

We now show that IB{Z) is itself an i.b.s. containing Z. By definition Z C IB{Z). Now let 
z € IB{Z) be arbitrary and let z' £ N{z) \ IB{Z) be arbitrary as well. If Wz' > Wz, then since 
there exists an increasing path from Z to z, by appending z' to this path we obtain an increasing path 
from Z to z' and thus z' G IB{Z), which is a contradiction. We conclude W^' < Wz, and the proof is 
complete. D 

We now show that the existence of a 'small' i.b.s. for N{v){ N{e) ) is independent of IVj,(iye) under 
an i.i.d. weighting. 



Lemma 4. Given an arbitrary node (edge) z, IB{N(z)) C Nci{z) holds iff there does not exist a node 
(edge) increasing path between some node (edge) z' € N{z) and z" G N(i+i{z)\N(i{z) which is contained 
entirely in G\z. As a result, if the node (edge) weights of G are generated i.i.d. from a continuous 
distribution F , then the event IB{N{z)) C Nd{z) is independent from Wz- 

Proof. If there exists an increasing path zi, . . . ,Zfc between N{z) and N(i+i{z) \ Nd{z), then the last 
element Zk G N^^i{z) \N^{z) must belong to IB{N{z)) and thus IB{N{z)) C N^{z) cannot hold. Now 
suppose no increasing path exists between N{z) and Nd+i{z) \ Nci{z) inside G\z. Then no increasing 
path can exist between N{z) and iVrf+i(z) \ Nd{z) inside G either, since in any such path we can find 
a subpath which does not use z. This completes the proof of the first part of the lemma. The second 
part is an immediate implication. D 

The usefulness of the i.b.s. comes from the following lemma, which informally states that the 
decisions taken by GREEDY inside an i.b.s. H are not affected by the complement of H in G. 

Lemma 5. Suppose H is an i.b.s. ofG. Then IG (G) (1 V (H) =ig{H) (Mg{G)nE{H) = Mg{H)), 
where the weights of H are induced from G. 

Proof. Let zi,Z2t ■ ■ ^Zm be the nodes (edges) of H ordered in decreasing order by their weight. We 
show by induction in /c = 1,2, ...,m that Zk G ig{G){zk G Mg{G)) iff Zk G ig{H){zk G Mg{H)). 
For the base case k = 1 observe that zi is the heaviest element of H. Since H is an i.b.s. then also zi 
cannot have a heavier neighbor in G\H. Thus GREEDY will select it both for G and H. 

We now prove the induction step and assume the assertion holds for all k' < k — 1 < m. Suppose 
Zk was not accepted by GREEDY when it was operating on G. This means that GREEDY accepted 
some neighbor of Zk which was heavier than Zk and, as a result, deleted Zk- Since H is an i.b.s. this 
neighbor must be in H, namely it is z^' for some k' < k. By the inductive assumption GREEDY 
selected Zk' when it was operating on H as well. Then all neighbors of z'j^ in H are deleted including Zk, 
and thus z^ cannot be accepted by GREEDY when operating on H. Similarly, suppose GREEDY did 
not select Zk when it was operating on H. Namely, GREEDY accepted some neighbor Zk' of z^ with 
k' < k. By the inductive assumption the same holds for GREEDY operating on G: z^i was accepted 
and all neighbors, including Zk were deleted. This completes the proof of the induction step. D 

We now bound the probability that IB{N{z)) is contained in N(i{z) when z is a node (edge) in a 
bounded degree graph G and the weights are random. 

Lemma 6. Let G be any graph of maximum degree r >2>, and suppose that the nodes and edges of G 
are equipped with i.i.d. weights from a continuous distribution F. Then for any node (edge) i(e) and 
any d > 0, 



P(/S(iV(i)) c Nd{v)) > 1 
F{IB{N{e)) C Ndie)) > 1 



r{r — 1) 
2(r - lf+^ 



{d + iy. ' 

where the first (second) inequality is understood in the context of node (edge) weights. 

Proof. Any length-/c path equipped with i.i.d. node (edge) weights generated using a continuous distri- 
bution is a node (edge) increasing path with probability equal to 1/(^+1)! ( l/k\ ). For every node z G G 
there exist at most r{r — 1)'^ distinct length— d paths in G \ z that originate on some node in N{z) \ z 
and use exactly one node from N{z). For every edge z £ G, there exist at most 2(r — l)'^^"^ distinct 
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length— (d+l) paths in G\z that originate on some edge in N{z)\z and use exactly one edge from N{z). 
Observe that every node increasing path originating in N{z) \ z and terminating in Nci+i{z) \ A^rf(z) 
must contain a length— d node increasing subpath originating in N{z) \ z which uses exactly one node 
of N{z). We then obtain the result by applying a simple union bound and Lemma |4l D 

We now state and prove the main result of this section. 

Theorem 7. Let G € G{g,r) for some g > 4: , and d > L^^J ^^ arbitrary. Let T = T{r,r — l,d) 
have root 0. Suppose the nodes and edges of G and T are equipped with i.i.d. weights from a continuous 
distribution F . Then for every node i G V{G), edge e G E{G), and every child j of in T 

E[WiL{ieig{G))]-E[WoL{Oeig{T))] <E[w f^J'~^} ' (11) 



and 



(LVJ + 1)! 



E[WJie G MGiG))] - E[Wojimj) G Mg{T))] < E[W] \,,J,^, , (12) 



(L^J)! 



where W = F. Also the limits 



lim P(0 G ig{T)), lim P((0, j) G Mg{T)) (13) 

d— >oo d— >oo 



exist. 



Remark : It is important to note that the bounds of this theorem hold for any value of d > [^^J • 
It is this property which will ultimately lead to the existence of limits (|13|) , as we will see shortly in the 
proof. Later on the existence of these limits will lead to a simple expression for the limiting value of 

E[W,L{i G ig{T))] and E[WJ{e G Mg{T))]. 

Proof Denote LB{N{i)) with respect to G by H{i) and IB{N{0)) with respect to T by H{0) for 
simplicity. Let do = L^^J ^ '^- Then N^^^i) is a T{r,r — l,do) tree. We can construct a coupling in 
which T = T{r,r — l,d) is the natural extension of this tree with additional node weights generated 
independently from the node weights of G. In this setting the node i takes the role of the root of T. 
We have 

WJii G igiG)) = WJii G ig{G),H{i) C Nd„{i)) + WJii G ig{G),H{i) ^ Nd,{i)) 
= WoIiOeigiT),H{0) C Nd,m + Wa{ieigiG),H{i) ^ Nd,{i)), 

where the second equality follows from Lemma [5l This sum is upper bounded by 

< WoIiO G ig{T)) + WiI{H{i) (t Nd,ii)). 

It follows that 

E[WiI{i G igiG)) - WoI{0 G ig{T))] < EmL{Hii) ^ Naji))] 

= E[W]¥{H{i)<^Nd,{i)) 

r(r — 1 )'^o 
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where the equahty follows from the second part of Lemma U] and the last inequality follows from 
Lemma [6l We complete the proof of the bound ()lip by establishing a similar bound with the roles of 
Wil{i G ig{G)) and Wo/(0 € ig{T)) reversed. 

We now establish the last part of the theorem, namely the existence of limits (J13p . Consider any 
d' > d. Let T' = T{r, r — 1, d') be a natural extension of the tree T with the same root 0. Namely, the 
additional nodes of T' are weighted i.i.d. using F, independently from the weights of the nodes already 
in T. Let H' denote IB{N{0)) with respect to T'. We have 

1(0 G ig{T)) = /(O E ig{T),H' C T) + /(O G ig{T),H' ^ T) 
= 7(0 G Jg(r'), H' CT)+ 7(0 G ig{T),H' f_ T) 
<i{oeig{T')) + i{H' <^T). 

This implies that 

P(0 G Ta(T)) - P(0 G ig(T')) < F{H' ^ T) 

<r(r- 1)^^/(^+1)!, 

where the last inequality follows from Lemma [6l By reversing the roles of T and T' we obtain 

P(0 G ig{T')) - P(0 G ig{T)) < r{r - l)'^/{d + 1)!. 

We conclude that the sequence P(0 G ig{T{r, r — 1, d))),d > 1 is Cauchy and therefore has a limit. This 
concludes the proof for the case of independent sets. The proof for the case of matchings is obtained 
similarly and is omitted. D 

4 Bonus, bonus recursion and proofs of the main results 

4.1 Bonus and bonus recursion 

In this subsection, we introduce the notion of a bonus for independent sets and matchings on trees. 
Consider a tree T with root 0, whose nodes (edges) are equipped with distinct positive weights Wi,i G 
T{Wij,{i,j)GT). 



Definition 2. For every node i ^T let 
MS{i) - 



Wi if i is a leaf; 

WiKWi > maxjgc'(j) S{j)) otherwise; 

if i is a leaf; 

maXj£(7(j)(VFjj I{Wij > MS{j))) otherwise; 



The quantities S{i),MS{i) are called the bonus of i in the rooted tree T and will be used for the 
analysis of independent sets and matchings respectively. Let Tj be the subtree of T rooted at i. Note 
that the bonus of i depends only on the subtree Tj. To avoid ambiguity, for a subtree 77 of T rooted at 
i we let MSnii) denote the bonus of i computed w.r.t. the subtree 77. We now prove that S{i){MS{i)) 
determines whether the root belongs to ig{T){Aig{T)). 

Proposition 1. Given a weighted rooted tree T with distinct positive weights on the nodes and edges, 
for every node i and edge {i,j), 
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1. [Independent sets]. S{i) = Wil{i € XQ{Ti)). Specifically, for the root we obtain S{fd) = WqI{Q G 
1G{T)). 

2. [Matchings]. MS{i) = uiaXj^(jr^\WijI{{i,j) € MQ(Ti)). Specifically, for the root we obtain 
MS{0) = max,ec(0) Wo^IdOJ) G Mg{T)). 

3. [Matchings]. For every j G C{i), {i,j) G Mg{Ti) iff Wij > max{M S (j) , M S h (i)) , where H is 
the subgraph of Ti obtained by deleting {i,j) ^Tj. 

Proof. Let d be the depth of T. We first prove part 1. The proof proceeds by induction on the depth 
of a node, starting from nodes at depth d. Thus for the base case, suppose i belongs to level d of T, 
and, as a result, it is a leaf. Then S{i) = Wi. On the other hand, Tj = {i} and I{i G IG{Ti)) = 1, and 
the claim follows. 

For the induction part assume that the hypothesis is true for all nodes at depth > k + 1 for k < d—1. 
Let i be some node at depth k. Observe that GREEDY selects node i for inclusion in IQ{Ti) iff i 
is not adjacent to any nodes in Tj that are selected by GREEDY prior to node i being examined 
by GREEDY . The set of nodes in Tj examined by GREEDY before i are those nodes j such that 
Wj > Wi. Thus the event i G ig{Ti) occurs iff for ah j G G{i) s.t. Wj > Wi, we have j ^ lG{Ti). We 
claim that for each such j, j ^ ZQ{T.i) iff j ^ ZQiTj). Indeed, the event j ^ XQ{Ti) is determined by a 
subgraph H of Tj induced by nodes with weights at least Wj. Therefore this subgraph does not include 
i if Wj > Wi. It follows that H D Tj is disconnected from the rest of H and then the claim follows. 

We conclude that i G XQ{Ti) iff for each j G C{i) either Wi > Wj, or j ^ ZQiTj). Combining, i G 
J^(Tj) iff Wi > maxjgc'(i) Wjl{j G TG{Tj)), but by the inductive hypothesis, Wjl{j G TQ{Tj)) = S{j). 
Therefore, i G ZQ{Ti) iff Wi > maXj^QU) S{j) and the inductive assertion follows. 

We now prove part 2. The proof is again by induction on the depth of a node. Base case: i is at 
lowest depth d and thus a leaf. In this case, G{i) = 0, and thus maxjgc'(j) WijKWij G M.Q{Tj)) = 
MS(i) = 0. For the induction step assume that the induction hypothesis is true for all nodes at depth 
> k + l,k < d — 1. Let i be some node at depth k. If i is a leaf we use the same argument as for the base 
case. Thus assume i is not a leaf. Suppose (i, j'l) G AiQ{Ti). We claim that then Wij^ > MS{ji). Indeed 
Observe that GREEDY selects (i,ji) for inclusion in A4Q{Ti) iff {i,ji) is not itself adjacent to any 
edges in Ti that are selected by GREEDY prior to {i,ji) being examined by GREEDY. Thus the event 
(i,ji) G MG{Ti) implies that for all I G C(ji) s.t. Wj^^i > Wij^, we have {ji,l) ^ MQ{Ti). Repeating 
the argument used for the case of independent sets, we claim that the event (j'l, I) ^ MQ{Ti) occurs iff 
the event (j'l, I) ^ M.Q{Tj-^) occurs. Therefore, the event (i, j'l) G A^^(Tj) implies that for each / G G{ji) 
either Wij^ > Wj^^i or {ji,l) ^ MQ[Tj^), namely the event Wij^ > maxK^dj^) Wj^,iHUi,l) ^ MGiTj^)) 
occurs, which by induction hypothesis is equivalent to the event Wij-^ > MS{ji), as claimed. 

We now complete the proof of the induction step. First assume that Wij < MS{j) for all j G G{i). 
Then from the preceding claim we obtain that no edge {i,j) belongs to A4Q{Ti) and the claim is 
established. Otherwise, let ji G G{i) be such that Wij^ is the largest weight among edges Wij,j G G{i) 
satisfying Wij > MS{j). By the choice of ji and the preceding claim it follows that if Wiji > Wij^, 
then {i,j') ^ A^^(Tj). Thus it remains to show that {i,ji) G A^^(Tj). The GREEDY examines («,ji) 
after edges {i,j) with Wij > Wij-^, but before edges (i, j) with Wij < Wij^. Since edges with Wij > Wij^ 
were rejected, then whether (i, j'l) is accepted is determined completely by (i, j'l) plus the subtree T{ji). 
Repeating the argument above, we see that {i,ji) is accepted iff Wij^ > inaxi^c{ji)^ji,i^{iJiT^) ^ 
MQ{Tj^)), which, by the inductive hypothesis occurs iff Wij^ > MS{ji), which is satisfied by the choice 
of ii- 

To prove part 3, we repeat the arguments used to prove parts 1 and 2 to observe that the GREEDY 
selects {i,j) iff for all neighbors / of j in T{j) with Wj^i > Wij, the edge (j, I) is rejected by GREEDY 

13 



in T{j), and for all neighbors I y^ j of i in T{i) \ {{i,j) UT(j)), with Wa > Wij, the edge (i, /) is rejected 
hy GREEDY mT{i)\{{i,j)UT{j)). D 

4.2 Distributional recursion for bonuses 

We now introduce two sequences of recursively defined random variables {X^^r}, d > 0, and {Y^^r}, d > 
for any given integer r > 2. These sequences will play a key role in understanding the probability 
distribution of the bonuses introduced in the previous subsection. 

Given a positive integer k, let B{k) denote a Bernoulli random variable with ¥{B(k) = 1) = 1/k. 
Define 




d = 0; 



{X^> + 1)B{X^> +1) d>l; 



(14) 



d = 0; 

\{r) (15) 

(yd_i,, + l)S(Fd_i,, + 1) d>l; 



For an integer-valued r.v. Z > 1, the joint probability distribution of Z, B{Z) is assumed to be 
P(Z = z,B = l) = {l/z)W{Z = z). 

It is immediate from these recursions that for all d > 1 

E[Xd,,] = l, E[yd,,]=r. (16) 

In the following lemma we show that the distribution of the bonuses S and MS on regular trees 
have a very simple representation in terms of the constructed sequences {Xd^r}-, \Xd,r}- 

Lemma 7. Suppose the nodes and edges of a tree T{r,d) with root are equipped with i.i.d. weights 
generated according to a continuous distribution F. Then S'(O) = W'^^'^'^^ , and MS{0) = VI/<^d.'->^ 
where W = F. 

Proof. We first prove the identity for S'(O). The proof proceeds by induction on d. For the base case, 

suppose d = 0. Then S{0) = W and the conclusion trivially holds. For the induction step, assume the 
hypothesis is true for all d' < d. Let Tj denote the depth d — 1 subtree of T{r, d) rooted at the j-th child 
of 0. By the inductive hypothesis we have that S{j) is distributed as W '^-^•'^^ . This implies that 
maxjg(7(j) S{j) is distributed as (W^-^'^-^^^^)'^'^^ . Since Wq is drawn independent oiva.&yij^(j{i) 'S'(j), we 

have that S'(O) = WqI{Wq > (VF<"^d-i.'-> )<''>). The event underlying /(•) means that Wq is the largest 

D (r) 

among K + 1 random variables distributed according to F, where K = X^-j^^. The required identity 
then follows from the definition of X^^j.- 

We now establish the identity for MS{0) using induction in d. For the base case d = we have 
MS{0) = and the conclusion trivially holds. For the induction case, assume that the hypothesis is 
true for all d' < d. Let again Tj denote the depth d — 1 subtree of T(r, d) rooted at the j'-th child 
of 0. By the inductive hypothesis MS{j) is distributed as W '^^^•''^ . Since Wqj is drawn i.i.d., we 

have MS{Q) = maXj(,c(o)WojI{Woj > VF<^d-i,'->), where WojI{Woj > M/<'^d-i.'->) is independent for 
each j. Note that for each j, WojI{Woj > W '^^^'''^) is by definition distributed as the maximum 
of (ld-i,r + l)B{Yd-i^r + 1) i-i-d. realizations of W. Thus ma^j^c{o) WojI{Woj > M^<^d-i.'->) is 
distributed as the maximum of r independent samples of the maximum of (1^-1^,. + l)B(Yd-i^r + 1) 
i.i.d. realizations of W, which by the basic properties of maxima is distributed as the maximum of 
{{Yd-i^r + l)B{Yd^i^r + l))'*"-* i.i.d. realizations of W, from which the lemma follows. D 
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Recall that <j)x denotes the probability generating function for a discrete r.v. X. 

Lemma 8. Suppose the nodes and edges of a tree T = T(r, r — l,d) with root are equipped with i.i.d. 
weights generated from a continuous distribution F . Then 

and for every j E N{0) 

E[Wo,I{{0,j) G Mg{T))]=K[WcPY,_,,^.,+Y,,^.AF{W))], 

where W is distributed according to F, and random variables W,Xfi-i^r-i-,Yd-i,r-i-,Yd^r-i o^'e indepen- 
dent. 

Proof. We first prove the result for independent sets. By Proposition [1] and the definition of ^(O), 
E[Tyo/(0 G TG{T))] = E[WqI{Wo > maxjgc{0) S{j))]. By LemmaEl for each j G C(0), 

S{j) = 14/<"''^d-i,r-i>_ ii then follows that m.aXj^(jfQ\ S{j) = W d-i,r-i ^ which is independent from 
Wq. Thus we have 

E[WoIiOeigiT))] = E[E[WoI{OeigiT))\Wo]] 

oo 

= E[Wo Y^{F{Wo))'nx^J\,_^ = k)] 

k=0 

= E[Wo(j) ir) {F{Wo))] 

^d-l,r-l 

We now prove the result for matchings. From the third part of Proposition [T] we have 

E[Wojimj) G ig{T))] = E[WojI{Woj > max(M5j^(0),M5T,(j)))], 

where H is the subgraph of T obtained by deleting (0, j) and Tj - the subtree of T rooted at j. Observe 
that ff is an r — 1 regular tree with depth d, namely it is T(r — 1, d), and Tj is an r — 1 regular tree with 

depth d-1. Thus applying LemmaEl MSh{0) = W<^'i'—^> and MSr^iJ) = W<^'i~^--~^> . Repeating 

(r) 

the line of argument used for independent sets, replacing X^_^,^_^ with ld-i,r-i + ^d,r-i, we obtain 
the result. D 

4.3 Limiting distribution of X^^r and Y^^r 

In this subsection we show that the sequences {X^; ,,}; d >0, and {Y^ ,,}, d > converge in distribution to 
some limiting random variables, by exploiting their recursive definitions. We then use this convergence 
along with Lemma [8] to express the quantities of interest in terms of the p.g.f. of these limiting random 
variables. 

Lemma 9. There exist r.v. Xoo.D^^.r such that for all k > 0, lim^^oo JP(^d,r = k) = P(Xoo,r = k) 
and lim^^oo IP(>d,r = k) = P(yoo,r = A:)'. 
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Proof. We begin by establishing the existence of the hmit Unirf^oo 'PiXd,r = k) for A; = 0. The case of 
k > 1 will be established by induction. Consider T = T{r, d) with root whose nodes are weighted 
i.i.d. with an arbitrary continuous distribution F. Prom Proposition [H part 1, we have that S'(O) = 
iff ^ IQ(T). Therefore by Lemma [7] 

P(5(0) = 0) = P(0 i ig{T)) = ¥{Xd,r = 0). 

But the last quantity has a limit as d ^f oo as asserted by the last part of Theorem [71 
Assume now that the limits exist for all A;' < A; — 1. We have 

nxd,r = k) = lnx^;l,,. = k-i) = ^ ^ H ^(^^-i,- = ^o, 

(fci,fe2,...,A;,.)l<J<r 

where the sum is over all partitions (fci, k2, ■■■, k^) with ki > 0, X]i<j<r ki = k — 1. Since ki < k — 1 for 
each i, by the inductive assumption the limits limrf_>oo P(^d-i,r = ki) exist. The same assertion then 
follows for ¥{Xd^r = k) and the proof is complete. 

Define Xoo,r by P(Xoo,r = k) = lim^^oo P(-'^d,r = k). We need to show that X^fcPl-'^oo.r = A;) = 1. 
Pix e > and K > 1/e. Applying Markov's inequality to ([IB]) we have 1 > ^o<k<K^(-Xd.,r = k) > 
1 — 1/K > 1 — e. Then the same applies to the limits as d ^ oo. The assertion then follows. 

The proof for the matching case is similar. D 

The recursion properties p^ which are used to define {X^ ,.}; {^d,r} carry on to XQo,r, ^oo,r) which, 
as a result, satisfy recursive distributional equations. 

Lemma 10. The following equality in distribution takes place 

Yoo,r = ({Yoo,r + l)B{Y^,r + 1)^ ' 



Proof. Applying Lemma O for each A; > 0, 

P(Xoo,r = k)= lim W{Xd,r = k) 



d-^oo 

(ki,...,kr)l<l<r 
{ki,...,kr) l<l<r 



where the sums are over all partitions {ki, ..., kr), ki > 0, ^i<:,Kr ki = k — 1. But the last expression is 

exactly the probability that {Xoo\r + l)B{XQo\r + 1) takes value k. The assertion then follows. A similar 
argument shows the identity for Yoo,r- D 



4.4 Solving for the distribution of Xd^r and 1^ ^ 

We now show that (/iXoo r i^) ^^^ ^Yca ,■ (^) have a very simple explicit form. We first show that they 
satisfy simple differential equations. 
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Lemma 11. For every s G [0, 1) 



d i 



Proof. We first prove the identity for Xao,r- Applying Lemma [TOl 

oo ^ 

^X^M = nXoo,r = 0) + 5^ ---.^•+1P(XW, = k). 



fc=0 



Thus since the p.g.f. of any non-negative integer-valued r.v. is differentiable on [0,1), and can be 
differentiated term-by-term, we obtain 



d 



7 OO -. 



fc=0 

oo 

= Y. s¥(xg, = k) 

1^ 

As for for Yoo,r we have from Lemma [TOl that (py {s) is equal to the p.g.f. of {Yoo,r + ^)B{Yoo,r + !)• 
Therefore 

fe=0 

D 
We now solve for the p.g.f. of Xoo,r, i^oo.r. 

Proposition 2. For every s E [0, 1], 

]_ 

'/'X«,,.(s) = (r-(r-l)s) -i 

0yoo.r(s) = (r - (r - l)sy^^. 
Proof. Applying the chain rule and Lemma [TT] 

^/~Jr'\s) = -(r- l)(0x^,,(.))-^0x^,,(.) 

= -(r-l). 

]_ 

We conclude that (pXoo.ri^) — (c— (r— l)s) '— i for some constant c for all s G [0, 1). Since limsji 4>Xoo.ri^) - 

1 
0Xoo,r(l) = 1) we conclude that c = r and therefore 4'Xoo,ri^) = (r — (r — l)s) '— i and the required 

identity is established. 
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Similarly, we find 

= -(r-l)(<Ay^_,(.))-Vyo...(s) 
= -(r-l). 

Using this and 4>Yao,r^^ — 1 the required identity is established. D 

4.5 Proofs of Theorems [1] and [2] 

We now have all the necessary results to complete the proofs of our main theorems. 

Proof of TheoremUl Applying the last part of Lemma[9]and the Dominated Convergence Theorem (see 
|Dur96| ) we have that ilW = F, then 

Here W serves as a dominating random variable. Applying Proposition [2] the right-hand side of this 
expression equals 

E[W{r-l-{r-2)F{W))~^]= x({r-l)-{r-2)F{x)] "'^ f{x)dx. 

Now observe that E[W'[T^]] = Eiey(G) ^[^^^(^ ^ ^^)]- Applying part i^ of Theorem [71 Lemma El 
and letting d — > oo we obtain the result. D 

Proof of TheoremlB Observe that E[W"[A^^]] = J2eeE{G)^i^eI(.e e MQ)]. The rest of the proof is 
similar to the case for independent sets. D 

5 The variance of GREEDY 

In this section we prove our main results on the variance of GREEDY. 
Proof of TheoremlM Since W[ig] = Y.i(.v ^*-^(^ ^ ^^)' ^^ have 

Var{W[ig]) = Y, [nWiWjI{i,j G IQ)] - K[Wa{i € ig)]E[WjI{j G IG)] 

= ^ (E[Wll{i G ig)] - {E[WJ{i G ig)])^) 



+ Y.Y. Y. {nw,Wji{i,j G ig)] - E[Wii{i G ig)]E[Wji{j G ig)] 

i€Vd>OjeNa+i{i)\Na{i) 



< nElW^] 



+ EE E {E[w,Wjiii,jeig)]-E[Wii{ieig)]E[Wji{jeig)]y (i7) 

ieVd>Oj(^Na+i{i)\Naii) 
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Our proof approach is to show that the terms in parenthesis are sufficiently close to each other, provided 
that the distance between nodes i and j is sufficiently large. 

Fix an arbitrary i ^V and j € N(i+i{i) \ N(i{i) for d>2. Recall the notion of the influence blocking 
subgraph from Section [3l Denote IB{N{i)) and IB{N{j)) by Hi and Hj for short. Let / = [d/2\ — 1. 
Consider the event £ = {Hi C Ni{i) A Hj C Niij))". 

We have 

E[WiWjI{iJ G ig)] = EmW,I{i,j G ig,H, C Ni{i),Hj C Ni{j))] 

+ E[W^WjI{i,jeig)Ii£)]. (18) 

We first analyze the second summand. 

E[W^WjI{i,j G ig)I{8)] < E[W^WJIi£)] 

< E[W^Wjil - I{H, C Niii)))] + E[WiWj{l - I{Hj C Niij)))] 

= E[W,{1 - I{Hi C Ni{i)))]E[Wj]+E[Wj{l - I{Hj C Ni{j)))]Em] 

where the equality holds since both Wi and the event Hi C Ni{i) depend only on the weight configuration 
inside Ni^i{i) which does not contain node j and vice verse. Next, applying the second part of LemmaH] 
and Lemma [6] we have 

E[Wi{l - I{Hi C Niii)))] = E[Wi\{l - F{H, C Ni{i))) 

< E[Tyi]r(r - 1)7(/ + 1)!. (19) 

Thus we obtain 

E[W^WjIii,j G ig)I{£)] < 2E[Wi]E[Wj]r(r - if /{I + 1)! 

= 2E[Ty]V(r - 1)7(/ + 1)! (20) 

We now analyze the first summand in (|18p . Let Hi = HiriNi{i), Hj = Hjr)Ni{j). Namely, Hi and Hj are 
the subgraphs of Ni{i) and Ni{j) induced by nodes V{Hi)nV{Ni{i)) and V{Hj)nV{Ni{j)), respectively. 
Observe that the random variables Wil{i G ig{Hi),Hi = Hi) and Wjl{j G ig{Hj),Hj = Hj) are 
independent. Indeed, since I{Hi = Hi) = I{Hi G Ni{i)) and I{Hj = Hj) = I{Hj G Ni{j)), they 
are completely determined by the weights inside Ni^i{i) and Ni-^-i{j) (respectively) and those do not 
intersect. Therefore 

E[WiWjI{i G ig{Hi),H^ = H„j G ig{Hj),H, = H,)] 

= E[WiI{i G ig{Hi),Hi = Hi)]E[WjI{j G ig{Hj),Hj = Hj)] (21) 

On the other hand 

I{i G ig{Hi),H, = Hi) = I{i G Tg{Hi),Hi C Ni{i)) 

= I{ieig,HiCNiii)) 

where the second equality follow from Lemma El Similarly we obtain 

I{j G ig{Hj),Hj = Hj) = I{j G ig,Hj C Niij)). 
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Thus, we can rewrite (1211) as 



E[WiW,Iii,j G ig,H, C Nii{},Hj C Ni{j))] 

= E[WJ{i G ig, Hi C Ni{i))]E[WjI{j G IQ, H, C iV/(j))]. 



We recognize the left-hand side of this equation as the first sumniand in (jlSp . Returning to (jlSp we 
obtain 

E[w,w,-/(i,i G ig)] - K[wa{i G ig, h, c iV/(i))]E[i^,-j(j G ig, Hj c ivKi))] 

<2E[VF]V(r - 1)7(/ + 1)!. (22) 

Also we have 

K[W^I{i G ig)] = E[W^Iii G ig, H, C iVKO)] + nWiI{i G 2:g(F,))(l - /(//i C Ni{i)))], 
and 

E[Ty,/(i G Tg{m)){l - I{H^ C iVKO))] < nW^{l - I{H, C Ni{i)))] < K[W]r{r - 1)7(/ + 1)!, 
where the second inequality is (|19p . It follows 

E[WiI{i G Zg)] - E[Tyi/(i G ig,Hi C iVz(i))] < min(E[iy]r(r - 1)7(/ + l)!,E[Wi/(i G IC?)]). 
A similar inequality holds for j. Putting these two bounds together we obtain 

K[Wa{i G ig)]E[WjI{j G ig)] - ElWJii G ig, H, C Ni{i))]E[WjlU G Zg, i/j C Ni{j))] 

< 2E[VF]V(r - 1)7(/ + 1)!, 

where trivial bounds E[WiI{i G J^)] < E[VFi],E[VFj/(i G ig)] < E[Wj] are used. Combining with 
bound (1221) we obtain 

'E[w^Wjiii,j G ig)] - E[w^iii G 2:g)]E[Ty,-/(j G ig)] 

<AE[Wfr{r-iy/{l + l)\ 
i+i 



r-\ 



E[Ty^](r-l)'+7(/ + l)!. 



We now use this estimate in ([IT]). Observe that |iVd+i(i) \iVd(i)| < r(r- 1)'^. Recah that / + 1 = \dl2\. 
Then for each i, considering the cases of odd and even d separately and observing that the estimate is 
also trivially an upper bound for the d = 0, 1 cases, the double sum ^d>Q)li]^N (i)\N (i) ^^ C3) is 
upper bounded by 



r-1 ^ ^^ ^ ' i\d2\)\ r-1 ^ ^^ k\ r-1 ^ ^^ k\ 

d>0 ^'- ' -'^ fe>0 fc>0 



\3fc+l 



< 8^nw'] J2 



(r-1) 



3fc+l 



k\ 



fc>0 

<8E[W^yexp{{r-lf). 
Our final upper bound on Var{W[ig]) becomes 

nE[W'^] + 8nE[W'^]r'^ exp((r - 1)^) < 9nE[W'^]r'^ exp((r - 1)^) 
This completes the proof. The proof for matchings follows similarly, and is omitted. 



D 
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6 Numerical results 

In this section we numerically evaluate the performance of GREEDY in several settings, and compare 
our results to the prior work. We first compare our bound, marked NEW in the table below, on the 
cardinality (normalized by the number of nodes) of a MIS in an r-regular graph of girth at least g 
(Corollary H]) to the previous bounds in ^She91j and [LWj . The bounds of |She91] are coming from their 
Theorem 3 (when g < 127) and their Theorem 4 (when g > 127), with Wi = 1 for all i (their formulas 
involve a notion of weighted girth). The bounds of [LW| are coming from their Table 2. Omitted values 
are those for which no corresponding results are given or the given bounds are trivial. Certain values of 
the form 2A; + 3 are emphasized to be compatible with the Table 2 given in [LWj . All values are rounded 
up to the nearest thousandth. As we see our new bounds are the strongest for many calculated values 



Table 1: Comparison of bounds for the cardinality of MIS in r-regular large-girth graphs 



,x 


5 


7 


10 




NEW 


|She91| 


|LW| 


NEW 


[She9l| |LW| 


NEW 


|She9l| 


|LW| 


50 


.302 


.288 


- 


.256 


.239 


.160 


.194 


- 


100 


.302 


.294 


- 


.256 


.243 


.211 


.197 


- 


203 


.302 


.304 


.262 


.256 


.250 


.211 


.201 


.169 


403 


.302 


.306 


.277 


.256 


.251 


.211 


.202 


.184 


2003 


.302 


.308 


.294 


.256 


.252 


.211 


.203 


.202 



of g and r > 7. Recall that for r > 7, our bounds are asymptotically (as g — > oo) equivalent to those of 
[LWj . and superior to those of |She91j . Note that our bounds converge to their limit much faster than 
the bounds of [She91| and p7] . 
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We now give our bounds for the cardinality of a MM (also normalized by the number of nodes) in 
an r-regular graph of girth at least g (Corollary [H). These are the first results for MM in this setting. 

Table 2: Bounds for the cardinality of MM in r-regular large-girth graphs 



,.v 


3 


4 


5 


6 


7 


10 


13 


25 


.437 


.427 


- 


- 


- 


- 


- 


40 


.438 


.444 


.450 


.454 


.424 


- 


- 


50 


.438 


.444 


.450 


.455 


.459 


- 


- 


75 


.438 


.444 


.450 


.455 


.459 


.468 


.449 


100 


.438 


.444 


.450 


.455 


.459 


.468 


.473 



Note that as r increases, the asymptotic (in r) size of a MM approaches that of a perfect matching 
(^), as expected from Corollary [5l 

We now give our results for MWIS and MWM with i.i.d Exp(l) (exponentially distributed with 
parameter 1) weights, and compare to the results given in |GNS) . The GREEDY columns show the 
expected asymptotic weight (normalized by the number of nodes) of the weighted independent set and 
matching returned by GREEDY as given in Theorems [1] and [21 while the [GNS] columns reflect the 
expected asymptotic weight of a true MWIS and MWM as computed in [GNS] . We only give results 
for r-regular graph with limiting girth, as no results for fixed girth are given in [GNS] . 



Table 3: Exact MWIS and MWM vs. GREEDY for r-regular large-girth graphs with i.i.d Exp(l) 
weights 





MWIS 


MWM 


r 


[GNS] GREEDY 


[GNSJ GREEDY 


3 

4 
5 
10 


.6077 .5966 

.5631^ .5493 

.5119 

.3967 


.7980 .7841 
.9022 .8826 
.9886 .9643 
1.282 1.242 



In all cases, GREEDY is nearly optimal. 

7 Conclusion 

We have provided new results for the performance of a simple randomized greedy algorithm for finding 
large independent sets and matchings in regular graphs with large finite girth. This provided new 
constructive and existential results in several settings. One of the interesting insights from this work is 
demonstrating a correlation decay property of the greedy algorithm which aids greatly the analysis of 
this algorithm. In addition to several explicit bounds on the sizes and weights of independent sets and 
matchings produced by the GREEDY algorithm, we established concentration results by bounding the 
variance of the values produced by GREEDY. As a byproduct, this shows that GREEDY is very 
robust w.r.t. random choices: running it several times will produce roughly the same result in terms of 
the cardinality of the produced independent set and matching. 



^reported incorrectly in [GNS] . 
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